Goto

Collaborating Authors

 Saint-Louis Region


WolBanking77: Wolof Banking Speech Intent Classification Dataset

Kandji, Abdou Karim, Precioso, Frédéric, Ba, Cheikh, Ndiaye, Samba, Ndione, Augustin

arXiv.org Artificial Intelligence

Intent classification models have made a significant progress in recent years. However, previous studies primarily focus on high-resource language datasets, which results in a gap for low-resource languages and for regions with high rates of illiteracy, where languages are more spoken than read or written. This is the case in Senegal, for example, where Wolof is spoken by around 90\% of the population, while the national illiteracy rate remains at of 42\%. Wolof is actually spoken by more than 10 million people in West African region. To address these limitations, we introduce the Wolof Banking Speech Intent Classification Dataset (WolBanking77), for academic research in intent classification. WolBanking77 currently contains 9,791 text sentences in the banking domain and more than 4 hours of spoken sentences. Experiments on various baselines are conducted in this work, including text and voice state-of-the-art models. The results are very promising on this current dataset. In addition, this paper presents an in-depth examination of the dataset's contents. We report baseline F1-scores and word error rates metrics respectively on NLP and ASR models trained on WolBanking77 dataset and also comparisons between models. Dataset and code available at: https://github.com/abdoukarim/wolbanking77.


Panoptic Segmentation of Environmental UAV Images : Litter Beach

Youme, Ousmane, Dembélé, Jean Marie, Ezin, Eugene C., Cambier, Christophe

arXiv.org Artificial Intelligence

Convolutional neural networks (CNN) have been used efficiently in several fields, including environmental challenges. In fact, CNN can help with the monitoring of marine litter, which has become a worldwide problem. UAVs have higher resolution and are more adaptable in local areas than satellite images, making it easier to find and count trash. Since the sand is heterogeneous, a basic CNN model encounters plenty of inferences caused by reflections of sand color, human footsteps, shadows, algae present, dunes, holes, and tire tracks. For these types of images, other CNN models, such as CNN-based segmentation methods, may be more appropriate. In this paper, we use an instance-based segmentation method and a panoptic segmentation method that show good accuracy with just a few samples. The model is more robust and less


Sentiment Analysis on the young people's perception about the mobile Internet costs in Senegal

Mbaye, Derguene, Seye, Madoune Robert, Diallo, Moussa, Ndiaye, Mamadou Lamine, Sow, Djiby, Adjanohoun, Dimitri Samuel, Mbengue, Tatiana, Wade, Cheikh Samba, Pablo, De Roulet, Munyaka, Jean-Claude Baraka, Chenal, Jerome

arXiv.org Artificial Intelligence

Internet penetration rates in Africa are rising steadily, and mobile Internet is getting an even bigger boost with the availability of smartphones. Young people are increasingly using the Internet, especially social networks, and Senegal is no exception to this revolution. Social networks have become the main means of expression for young people. Despite this evolution in Internet access, there are few operators on the market, which limits the alternatives available in terms of value for money. In this paper, we will look at how young people feel about the price of mobile Internet in Senegal, in relation to the perceived quality of the service, through their comments on social networks. We scanned a set of Twitter and Facebook comments related to the subject and applied a sentiment analysis model to gather their general feelings.


Ontologies-based Architecture for Sociocultural Knowledge Co-Construction Systems

Kaladzavi, Guidedi, Diallo, Papa Fary, Béré, Cedric, Corby, Olivier, Mirbel, Isabelle, Lo, Moussa, Kolyang, Dina Taiwe

arXiv.org Artificial Intelligence

Considering the evolution of the semantic wiki engine based platforms, two main approaches could be distinguished: Ontologies for Wikis (OfW) and Wikis for Ontologies (WfO). OfW vision requires existing ontologies to be imported. Most of them use the RDF-based (Resource Description Framework) systems in conjunction with the standard SQL (Structured Query Language) database to manage and query semantic data. But, relational database is not an ideal type of storage for semantic data. A more natural data model for SMW (Semantic MediaWiki) is RDF, a data format that organizes information in graphs rather than in fixed database tables. This paper presents an ontology based architecture, which aims to implement this idea. The architecture mainly includes three layered functional architectures: Web User Interface Layer, Semantic Layer and Persistence Layer. Introduction This research study is set in an African context, where the main problem is an economic, social development and the means to achieve it. Indeed, after the failure of several development models in the recent decades, theoretical research seems to be turning to the development knowledgebased approaches (UNESCO, 2014). The place of knowledge, science and technology in the current dynamics of growth gives rise to intensify the reflection within the economic field.


Waymo to test self-driving big rig as big week for autonomous trucks continues

The Independent - Tech

The autonomous vehicle division of Google's parent company will start hauling cargo using self-driving trucks, capping a busy week for next-generation shipping technology. Waymo, the driverless vehicle unit of Alphabet, announced a pilot programme that will have self-driving big rigs transport cargo to the company's data centres in Georgia. Several companies are vying to dominate the nascent self-driving vehicle industry, believing the technology will reshape how humans and goods travel. Waymo has already extensively tested autonomous cars intended to ferry people around. "Now we're turning our attention to things as well", the company said in a blog post, noting that driverless trucks pose unique tech challenges.


Self-driving cars attacked by angry San Francisco residents

The Independent - Tech

Technology and automotive companies touting self-driving cars as the future of transportation may have some work to convince San Franciscans, who keep attacking the vehicles. A third of traffic collisions involving autonomous vehicles in 2018 so far featured humans physically confronting the cars, according to data released by California. In one case, a taxi driver exited his cab and slapped the front passenger window of a General Motors Cruise parked behind him. No one was hurt, though the car sustained a scratch. In another case, a pedestrian hurtled across an intersection despite a "do not walk" sign, shouting as he went, and rammed his body into a different Cruise's rear bumper.


Michelle Obama says she uses social media 'like a grown-up' in apparent Trump reference

The Independent - Tech

Michelle Obama took an apparent swipe at Donald Trump's social media habits, saying she uses social media "like a grown-up". "How many kids do you know that the first thing that comes off the top of their head is the first thing they should express? It's like, 'Take a minute. Talk to your crew before you put that [out there] and then spell check and check the grammar,'" the former First Lady said during a panel in New York, according to People. While Ms Obama did not mention the President by name, Mr Trump is known for stream-of-consciousness bursts of tweets that periodically contain grammatical and spelling errors.


Classification approach based on association rules mining for unbalanced data

Ndour, Cheikh, Diop, Aliou, Dossou-Gbété, Simplice

arXiv.org Machine Learning

This paper deals with the binary classification task when the target class has the lower probability of occurrence. In such situation, it is not possible to build a powerful classifier by using standard methods such as logistic regression, classification tree, discriminant analysis, etc. To overcome this short-coming of these methods which yield classifiers with low sensibility, we tackled the classification problem here through an approach based on the association rules learning. This approach has the advantage of allowing the identification of the patterns that are well correlated with the target class. Association rules learning is a well known method in the area of data-mining. It is used when dealing with large database for unsupervised discovery of local patterns that expresses hidden relationships between input variables. In considering association rules from a supervised learning point of view, a relevant set of weak classifiers is obtained from which one derives a classifier that performs well.